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Abstract 



1 Introduction 



O 

(N 

S3 . 

We adapt the construction of subsets of {1,2,..., N} that contain no fc-term arithmetic 
progressions to give a relatively thick subset of an arbitrary set of N integers. Particular 
examples include a thick subset of {1, 4, 9, ... , TV 2 } that does not contain a 3-term AP, and a 
positive relative density subset of a random set (contained in {1, 2, . . . , n} and having density 
cn" 1 /^ 1 )) that is free of fc-term APs. 

H 

+-> 

For a finite set JV (whose cardinality we denote by N), set r^jV) to be the largest possible size 
of a subset of j¥ that contains no fc-term arithmetic progressions (fc-APs). A little-known result 
of Komlos, Sulyok, and Szemeredi [KSS75J implies that 

m 

>; r k {Jf) >Cr k ([N]), (1) 

"xl" . 

for an explicit positive constant C. Abbott |Abb90j reports that their proof gives C = 2 -15 , and 
indicates some refinements that yield C = 1/34. 
\ In this work, we focus on the situation when jV itself has few solutions and we can give bounds 

on rfc(«/K) that are much stronger than those implied by Eq. ([T|) and the currently best bounds on 
r k([N]). In particular, we adapt the Behrend-type construction [Q'BlOj of subsets of [N] without 
fc-APs to arbitrary finite sets jV . We draw particular attention to subsets of the squares and to 
subsets of random sets. As the statement of our theorem requires some notation and terminology, 
^ ■ we first give two corollaries. 

Our first corollary brings attention to the fact that while the squares contain many 3- APs, they 
also contain unusually large subsets that do not. Here and throughout this paper, exp(x) = 2 X and 
log a; = log 2 (x). For comparison, rs([N]) > N exp ( — 2 \/2i/Iog + \ log log N). 

Corollary 1. There is an absolute constant C > such that for every N there is a subset of 
{1,4,9,..., A^ 2 } with cardinality at least 

C ■ N ■ exp (-2v^v / l°glogiV + - log log log N) 

that does not contain any 3-term arithmetic progressions. 

Our second corollary identifies sets that have subsets with no A;- APs and with positive relative 
density . 
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Corollary 2. For every real ip and integer k > 3, there is a real 5 > such that every sufficiently 
large JY C Z that has fewer than ip\JY\ arithmetic progressions of length k contains a subset that 
is free of k-term arithmetic progressions and has relative density at least 5. In particular, for each 
5 > 0, if n is sufficiently large and JY C {1,2, . . . ,n} is formed by including each k independently 
with probability cn _1 ^ fe_1 ^ > 0, then with high probability JY contains a subset A with relative 
density 5 and no k-term arithmetic progressions. 

The structure of the proof requires us to consider a generalization of arithmetic progressions. 
A A;-term D-progression is a nonconstant sequence a\, . . . , a& whose (D + l)-st differences are all 
zero: 

D+1 (D + 1\ 

( • )a i+v = 0, {\<v<k-D-l). 

Equivalently, oi, . . . ,a& is a k-term D-progression if there is a nonconstant polynomial Q(j) with 
degree at most D and Q(i) = a% for i £ [k]. Clarifying examples of 5-term 2-progressions of integers 
are 1,2, 3, 4, 5 (from Q(j) = jf), and 4, 1,0, 1,4 (from Q(j) = (j-3) 2 ), and 1, 3, 6, 10, 15 (fromQ(j) = 

=o Qi3 % be a polynomial with degree D' > 1, so that Q(l), Q(2), . . . , Q(k) is 
a fc-term /^-progression for all D > D'. The quantity D'\qr)>, which is necessarily nonzero, is called 
the difference of the sequence, and (D', Q(l), D'\qD>) is the type of the sequence. Note that different 
progressions can have the same type: both 1,4,9,16,25 and 1,5,11,19,29 have type (2,1,2). For 
any set JY, we let Type^^^) be the number of types of fc-term D-progressions contained in JY . 
The proof of [O'BlOl Lemma 4] shows that Type j . iB (J / ) <C \JY\ diam(c/K). Since the type of a 
fc-term Z)-progression is determined by its first -D + 1 elements, we also have TYPEj^DiJY) < N D+1 . 
We define 

Tk d(J^) '■= m & x {|^| : A does not contain any fc-term ^-progressions} 



and recall the lower bound proved in [CTBlOj: 

r k , D m) 



N 

We can now state our main theorem. 



> Cexp ^-^(""i)/ 2 !)^- 1 )/™ tfk^N + ^ log log i\M . 



(2) 



Theorem 1. Let k > 3, n > 2,D > 1 be integers satisfying k > 2 n l D. Let \l/(jV) be any 
function that is at least 2. There is a constant C = C(k, D,^) such that for all JY C Z with 
TYPE*. < N^(N) (where N := \ JY\) 



r k ,D(>yY) 



> C exp ^_ n2 ( n - 1 )/ 2 L>( n - 1 )/ n 0og^(iV) + ^- log log V(N)\ 



N 

Corollary [2] is now straightforward: set D = 1 and *$>(N) = m&x{ip, 2} and take 



5 = exp (^-n2 { - n ~ 1 ^ 2 0ogC + ^- log log Cj 



to arrive at the first sentence. Considering the random set JY described in the second sentence 
of Corollary [21 for each pair (a, a + d) of elements of JY the likelihood of the next k — 2 elements 
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a + 2d, . . . , a + (k— l)d of the arithmetic progression being in jY is (cn ^) k 2 . Consequently, 
the expected number of fe-term arithmetic progressions in JV is 

(„-l/(*-l))*-2 < ^l n h/(k-l) ] 

and the expected size of ,jV is N = n ■ cn -1 /^ -1 ) = cn k ^ k ~ 1 \ We can take ^(N) to be a constant 
with high probability, and so Corollary [2] follows from Theorem [TJ 

Corollary [JJ is only a bit more involved. It is known (perhaps since Fermat, see |Con08trCon07[ 
lvdP071lBFST)3llF( )04|lgK05|IMcR,10j for a history and for the results we use here) that while the 
squares do not contain any 4-term arithmetic progressions, the 3-term arithmetic progressions 
a 2 ,b 2 ,c 2 are parameterized by 

a = u(2st - s 2 + t 2 ), b = u(s 2 + i 2 ), c = u(2st + s 2 - t 2 ), 

with s,t,u > 1 and gcd(s,i) = 1. Merely observing that s,t,u > 1,6 < N yields that there are 
fewer than 2irNlogN triples (s,t,u) with a, b, c in [N], i.e., 

Type 3i i({1,4,9,... ,N 2 }) < 2nN\ogN. 

Now, setting k = l,n = 2, D = 1, ^(iV) = 27rlog ./V in Theorem [1] produces Corollary [TJ 

Section [2] gives a short outline of the construction behind Theorem [lj which is given in greater 
detail in Section [3l We conclude in Section H] with some unresolved questions. 

2 Overview of construction proving Theorem [j] 

Throughout this work we fix three integers, k > 3, n > 2, D > 1, that satisfy k > 2 n ~ 1 D; in other 
words, one may take n = \log(k/D)~\ . 

In this section, we outline the construction, suppressing as much technical detail as possible. In 
the following sections, all definitions are made precisely and all arguments are given full rigor. 

Fix &(N), and take / CZ with \<yV\ = N, and so that Jf contains less than N^f(N) types of 
/c-term D-progressions. The parameters No,d,8 are chosen at the end for optimal effect. 

Let Aq = Rk,2D(No) be a subset of [Nq] without fc-term 2D-progressions, and 

I A) | =r kt2 D(N ). 

Consider u, a in T d (we average over all choices of oj, a later in the argument), and set 

A := {a £ JV : aw+«mod T = (xi, . . . ,Xd), \xi\ < 2~ D ~ 1 , x 2 G Annuli}, 

where Annuli is a union of thin annuli in M, d with thickness 6 whose radii are affinely related to 
elements of Aq. Set 

T := {a £ A: there is a A;-term D-progression in A starting at a }. 

Then A \ T is free of /c-term D-progressions, and so d(^V) > \A \ T\ = \A\ — \T\, and more 
usefully 

r ktD {jy) >E 5 j,s[|A|]-E G ; ) «[|r|], 
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with the expectation referring to choosing uj, a uniformly from the torus T d . We have 
Egj,ts[|^|] = EnrpEzr = [N voI(Annuli)] = N voI(Annuli). 

We also have 

where E(D',a,b) is 1 if A contains a progression of type (D',a,b), and is otherwise, and the 
summation has TYPE& £>(</K) summands. Using the assumption that Aq is free of /c-term 2D- 
progressions, we are able to bound 

E w> a [E(D', a, b)) 

efficiently in terms of the volume of Annuli and the volume of a small sphere. We arrive at 

Eoj,a[|T|] < TYPE fcjD (^K) vol( Annuli) vol(B all), 

which gives us a lower bound on r^^^yV) in terms of ^f,NQ,d,5 and Aq. The work [O'B10| gives 
a lower bound on the size of Aq, and optimization of the remaining parameters yields the result. 



3 Proof of Theorem [T] 

The open interval (a — b, a + b) of real numbers is denoted a ± b. The interval [1, IV] nZ of natural 
numbers is denoted [TV]. The box (±2~ D ~ 1 ) d , which has Lebesgue measure 2~ dD , is denoted Box^). 
We define Box = [-1/2, l/2) d . 

Although we make no use of this until the very end of the argument, we set 



d :-- 



/2 / log*(AQ 
D 



!/(«+!) 



Given x £ M. d , we denote the unique element y of Boxo with x — y S 7L d as x mod 1. 

A point x = {X\, . . . ,Xfj) chosen uniformly from Box£> has components Xj independent and 
uniformly distributed in (— 2~ D ~ 1 , 2~ D ~~ l ). Therefore, ||x||2 = Yli=i-^-f * s the sum of d iidrvs, 
and is consequently normally distributed as d — > oo. Further, H^Hl has mean n := 2~ 2D d/12 and 
variance a 2 := 2~ 4D (i/180. 

Let Aq be a subset of [Nq] with cardinality 2d([-Wo]) that does not contain any /c-term 2D- 
progression, and assume 25 Nq < 2~ 2D . We define Annuli in the following manner: 

r n_ ||2 _ 

Annuli := < x e Box D : ±^ — d e [J 

[ ' aeA 

where z £ fi ± a is chosen to maximize the volume of Annuli. Geometrically, Annuli is the union 
of \A\ spherical shells, intersected with Box^). From |O'B10[ Lemma 3], the Barry-Esseen central 
limit theorem and the pigeonhole principle yield: 

Lemma 1 (Annuli has large volume). If d is sufficiently large, Aq C [Nq], and 26 < 1/n, then 

2 

the volume of Annuli is at least - 2~ dD \AQ\5. 

5 
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Set 

A := Aip,a) = {n G jV ' : nw + amodT G Annuli}, 

which we will show is typically (with respect to uJ,a being chosen uniformly from Boxo) a set with 
many elements and few types of D-progressions. After removing one element from A for each type 
of progression it contains, we will be left with a set that has large size and no fc-term ^-progressions. 
Define T := Tip, a) to be the set 

3b 6K,D'e [D] such that Aip,a) contains 1 
a fc-term progression of type (D', a,b) y 

which is contained in A(p,a). Observe that A \ T is a subset of .Jf and contains no fc-term 
-D-progressions, and consequently d(<jV) > \ A\ T\ = \ A\ — \T\ for every uJ,a. In particular, 

r k , D (^) > [\A \ T\] = E^ [\A\ - \T\] = E^ [\A\] - E^ [\T\] . (3) 

First, we note that 

Euj.s [\A\] = P ^ [neA)= ^P«[n£i]= N voI(Annuli). (4) 

ng,yK n€,yK 

Let E(D',a,b) be 1 if A contains a fc-term progression of type (D',a,b), and E(D',a,b) = 
otherwise. We have 

\T\< E{D',a,b), 

(D',a,b) 

where the sum extends over all types (D', a, b) for which D' G [D] and there is a fc-term D'- 
progression of that type contained in ,jV; by definition there are APk^jV) such types. 

Suppose that A has a /c-term progression of type (D',a,b), with D' G [D]. Let p be a degree 
D 1 polynomial with lead term p^i = b/D'l ^ 0, and • • • ,p(k) a D'-progression contained in A. 
Then 

Xi := p(i) uJ + a mod T G Annuli C Box_d . 
We now pull a lemma from |Q'B1CH Lemma 2]. 

Lemma 2. Suppose that p(j) is a polynomial with degree D' , with D'-th coefficient p' D , and set 
Xj := uJp(j) + a mod 1. If xi,~x~2, ■ ■ ■ ,Xk are i> n Box£> and k > D + 2, then there is a vector 
polynomial P(j) = Yld=Q Pif with P(j) = Xj for j G [k], and D'\Pjy = uJD'lpD' mod 1. 

Thus, the x~i are a D'-progression in M d , say P(j) = Y^iLoPiJ* nas = % an d D'lPry = 
D'\p£)> ZJ mod 1 = buJ mod 1. Recalling that z was chosen in the definition of Annuli, by elementary 
algebra 

a 

is a degree 2D' polynomial in j (with real coefficients), and since P{j) = Xj G Annuli for j G [k], 
we know that 



«« 6 U (-^r") 

ae^o 
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for all j G [k], and also Q(l), ■ ■ ■ ,Q(k) is a 2D'-progression. Define the real numbers a,j G Aq, 
ej G ±5 by 

1 

No 

For a finite sequence (a«)f =1 , we define the forward difference A(aj) to be the slightly shorter 
finite sequence (a„+i — a v )*z\. The formula for repeated differencing is 

£(T) ( - 1)iai+ "J • 

We note that a nonconstant sequence (cij) with at least 2D + 1 terms is a 2D-progression if and 
only if A 2D+1 (cii) is a sequence of zeros. If o« = p(i), with p a polynomial with degree 2D and lead 
term p2D 7^ 0, then A 2D (a,i) = ((2D)lp2D), a nonzero-constant sequence. Note also that A is a 
linear operator. Finally, we make use of the fact, provable by induction for 1 < m < k, that 



\A m ( ai )\< 2 m_1 ^maxai - mina^ 



We need to handle two cases separately: either the sequence (aj) is constant or it is not. Suppose 
first that it is not constant. Since aj G Aq, a set without /c-term 2Z)-progressions, we know that 
A 2D + 1 (oi) ^ (0), and since (oj) is a sequence of integers, for some v 

\A 2D+1 ( ai )(v)\ > 1. 

Consider: 

(0) = A™+\Q(i)) = ±A™+\ ai ) + A 2D+1 (ej), 

JV 



whence 



JV iV 



Since |ej| < <5, we find that 

2D+1 



£( 2 r<-^ 



< 2 2D+1 5, 



|A 2D+1 ( ei )W| 

1 i=0 

and since we assumed that 2<5iVo < 2~ 2D , we arrive at the impossibility 

— < |A 2D+1 (e,)(^)| < 2 2D+1 5 < 2 2D ■ — = — . 
N ~ 1 1 l,K n ~ N No 

Now assume that (aj) is a constant sequence, say a := aj, so that 



for all j G [fc]. This translates to 



ITO)lll€A*-(«-^y±^- 

Clearly a degree 2D' polynomial, such as H-PC?)!! 2 ;, cannot have the same value at 2D' + 1 different 
arguments; we pull now another lemma from [O'B10|, Lemma 1] that quantifies this. 
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Lemma 3. Let 5, r be real numbers with < 5 < r, and let k,D be integers with D > 1, k > 2D + 1. 
IfP(j) is a polynomial with degree D, andr—5 < \\P(J)\\2 < r+5 for j £ [k], then the lead coefficient 
of P has norm at most 2 D (2D)! -1 ^ 2 \Z~5. 

Using Lemma O the lead coefficient Pry* of P(j) satisfies 

\\D'\P D ,\\ 2 < D'\2 D '{2D')r 1/2 V6o Z < VFaS, 

where F is an explicit constant. We have deduced that E(D',a,b) = 1 only if 

auo + a mod 1 6 Annuli and \\bZJ mod 1 1 1 2 < V Fo~5. 

Since a is chosen uniformly from Boxo, we notice that 

Pa [auJ + a mod 1 G Annuli] = vol Annuli, 



independent of 57. Also, we notice that the event {\\buJ mod 1 1 1 2 < VFa5} is independent of a, and 
that since b is an integer, ZJ mod 1 and buJ mod 1 are identically distributed. Therefore, the event 
{\\buj mod 1|| 2 < y/FaS} has probability at most 



where Ball(x) is the (f-dimensional ball in M. d with radius x. It follows that 
Pnj,a [E(D',a,b) = l] < vol Annuli • vol Ball(V Fa5), 

and so 

Eu,q [|T|] < TYPE fcjD ( ! yf ) vol Annuli • vol Ball(\/ Fo~5). (5) 
Equations ((3|), (|4|), and ([5]) now give us 



> voI(Annuli) fl - TYPE fc.g(^) vo1Ball(v a FW)) 



r fc ,p(iV) 

iV - " "" V ' A' 
Setting 

- 2ed ^ r(ci/2) 2 / d / TYPE fc;D (^) \ - 2/d ^ rf 1 / 2 

~vrFo-Vd + 2y 2eci V N J ^(N) 2 / d 

we observe that 

l-^tf^VQlBALLCv^) 



N y ' d + 2 



Now, 



> vol Annuli 



iV - d + 2 

^2- dD 8\A \ 

> 2- dD d 1/2 ^{N)- 2/d \A \ 



2 1 
C exp ( -dD - - log *(iV) + - log d + log | A) 
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Recall that we set 



d := 



W2 fiogm)} 1/(,t+1) 



If 2D < k < AD, we take Nq = 1 and Aq = {1} to complete the proof. If k > AD, we set 

i m(N) 2 i d 



Nn := C- 



and use the bound 



\A \ = r kt2D (N ) > CN exp ^-n2^/ 2 (2D)^/ n (\og N ) l ^ n + i- log log iV ) 
proved in [O'BIOJ, to complete the proof. 



4 Unanswered questions 

Kolountzakis [personal communication] asks whether 

r 3 ,i([N]) = min{r 3) i(^): / a, = N}. 

More generally, which set JV (for fixed k,D,N) minimizes r*. It is not even clear to this 

author which set maximizes TYPE^ d {JV ) , nor even what that maximum is, although the interval 
[N] is the natural suspect and has TYPE fcjD ([iV]) < 2 D+1 N 2 . 

We doubt that there is a subset of the squares with positive relative density that does not 
contain any 3-term arithmetic progressions, but haven't been able to prove such. We note that 
there are 4-term 2-progressions of positive cubes: 3 3 , 16 3 , 22 3 , 27 3 is the image of 0,1,2,3 under 
Q(x) = ^4px 2 + ^p-x + 27. For which k,D,p are there k-tevm ^-progressions of perfect p-th. 
powers, and when they exist how many types are there? 
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